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A SYSTEM AND METHOD FOR ANALYZING CONTENT ON A WEB PAGE 

USING AN EMBEDDED FILTER 

5 FIELD OF THE INVENTION 

The present invention relates to the field of web page content. 
Specifically, the present invention relates to analyzing content on a web page 
using an embedded filter. 

10 BACKGROUND ART 

In 1998, Congress amended the Rehabilitation Act to require Federal 
agencies to make their electronic and information technology accessible to 
people with disabilities stating, "inaccessible technology interferes with an 
individual's ability to obtain and use information quickly and easily/' 

1 5 Therefore, Section 508 was enacted to eliminate barriers in information 

technology, to make available new opportunities for people with disabilities, 
and to encourage development of technologies that will help achieve these 
goals. The law applies to all Federal agencies when they develop, procure, 
maintain, or use electronic and information technology. Under Section 508 (29 

20 U.S.C. ' 794d), agencies must give disabled employees and members of the 

public access to information that is comparable to the access available to others. 
Moreover, it is important for companies to also adhere to the Section 508 
standards to ensure their electronic and information technology may be 
maintained, procured, and used by any federal agencies or other groups bound 

25 to the same standards. 



ORCL-2003-1 56-01 /ACM/JDY 



ORACLE CONFIDENTIAL 

In order to ensure that electronic content conforms to the rules of 
Section 508, the product (e.g., web site, web page, electronic form, etc.) should 
be checked for accessibility. For example, the checking includes ensuring that if 
a page includes an image there is text in the hyper text mark-up language 
5 (HTML) that can take the place of the picture for a browser that does not 

display pictures (e.g., a browser for the blind). In another example, there may 
be tags within a table (e.g., column and row tags) which allow a user to follow 
the table without needing to see it. In yet another example, a check may be 
necessary to ensure that a form field (e.g., user name and password) is correctly 
1 0 coded. That is, when a field is highlighted there is text or other type of prompt 
(e.g., voice) that describes the information requested by the field in question. 
This may include many types of visual and/ or audio HTML tags. 

The current method for analyzing an electronic product (e.g., a static 
1 5 web page) for conformation with Section 508 at the developmental level 

requires an analyzer to search out the web site in question (e.g., on a network), 
analyze the web page and store the results on a database. A report may then 
be requested by the testing party wherein the results stored on the database 
are formatted (e.g., as an itemized list) and returned to the testing party. 

20 

However, the current method is cumbersome, and as technology 
progresses the ability for an analyzer to reach into a network for a specific web 
page is becoming more difficult. For example, in order to more efficiently 
maintain a network, instead of every machine having its own Internet protocol 
25 (IP) address, a block of computers on a network may dynamically allocate the 
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IP addresses using DHCP. That is, the machines connected with the network 
may have IP addresses that change over time. For example, when a computer 
is on the network it may have one IP address, but then when it leaves the 
network it may give up the IP address, and the address may be assigned to 
5 another computer on the network. Thus, there is no need for a network to 
maintain a fixed, large list of IP addresses for each of the computers capable of 
accessing the network. Instead, there may be a pool of IP addresses which can 
cover the same number of computers with a much smaller framework. 

1 0 As the use of DHCP grows, actually finding a machine on a network 

becomes more difficult. Instead of simply inputting a network address, 
accessing a domain name server, and mapping directly to the machine having 
the network content, due to dynamic allocation of IP addresses, a user must 
now input the machine identifier as well. This is a cumbersome and difficult 

1 5 proposal. In general, machine identifiers are large and inconvenient to 
remember, input, pass on, and the like. 

Therefore, when testing a web page on the network, assurances are 
made that instead of using a domain name, the machine name is input. In 

20 addition, the use of the machine name may not allow the developer access to 
the computer due to possible incorrect mapping, thereby preventing the 
analyzer from finding the machine and ultimately the web page to be analyzed. 
Furthermore, as the number of machines accessing the dynamic network 
increase, the mapping and finding problems associated with finding a single 

25 machine become even more deleterious. 
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Due to the above stated issues, testing electronic and information 
technology to ensure proper operation under Section 508 has become difficult 
and time consuming. Moreover, when a change is made to Section 508, each 
5 previously tested page is found and tested again thereby encompassing 

additional time-consuming mapping, machine naming, and searching to find 
the correct electronic information. Furthermore, the pages are tested statically 
and cannot be tested dynamically. For example, once the information is 
analyzed, a second report is accessed and the web page updated accordingly. If 
1 0 any errors were present, the page must then be tested again after all the 
changes have been made and a third report accessed. 
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SUMMARY OF INVENTION 

The present invention provides, in various embodiments, describe a 
method and system for analyzing content on a web page using a filter-based 
system. Furthermore, the present invention provides a method and system for 
5 analyzing content on a web page which may be performed at near real time in 
the development process. Additionally, the present invention provides a 
method and system for analyzing content on a web page which may be used 
for accessibility checking. 



1 0 Specifically, in one embodiment, the present invention responds to a 

generated web page having content. Additionally, the web page is processed 
through a filter wherein the filter transfers the content of the web page to an 
external analyzer. The analyzer analyzes the content of the web page. The 
analyzer then returns the result of the analyzing to the server which appends 

1 5 the result of the analyzing to the content of the web page. In so doing, the 
ability to provide an analysis of the functionality of a web page at the time of 
development is provided. In one example, the analyzer checks for accessibility 
of the web page, but any rule based checking could be implemented. 

20 For example, in one embodiment, the present invention launches an 

embedded server in a development environment. This server is configured to 
run a servlet filter, which intercepts the request made to the server and the 
response provided by the server. The filter intercepts the data (e.g., HTML, or 
the like) generated by the embedded server, sends the data to a hosted service 

25 of a checker (e.g., accessibility) for processing, and then produces the checker 
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results on the same page as what was analyzed. This generates analysis results 
immediately for the user. In one embodiment, the system provides a mapping 
from what a user is working on (e.g., JDeveloper rapid application 
development item identifiers) and what the servlet filter reports (same 
5 identifiers) so that the caller can find and fix problems in the same environment 
in which they are working (or in a familiar context). 

In addition, the added functionality at development time provides an 
immediate solution to developers to help them develop code more efficiently. 
1 0 These features may apply to any HTML testing of any page, as well as other 
formats including regression testing capture/ playback and image accessibility 
analysis. 

In one embodiment, the system may be used in the development of 
1 5 accessibility coding for standards adherence. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and form a part 
of this specification, illustrate embodiments of the invention and, together with 
the description, serve to explain the principles of the invention. 

5 

FIGURE 1 is a block diagram of an embodiment of an exemplary content 
analyzing system in accordance with an embodiment of the present invention. 

FIGURE 2 is a flowchart of computer-implemented steps performed in 
1 0 accordance with one embodiment of the present invention for analyzing 
character data. 

FIGURE 3 is a diagram of an exemplary web page layout in accordance 
with an embodiment of the present invention. 

15 

FIGURE 4 is a diagram of an exemplary web page layout with appended 
content analysis results in accordance with an embodiment of the present 
invention. 

20 FIGURE 5 is a block diagram of an embodiment of an exemplary 

computer system used in accordance with the present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

Reference will now be made in detail to embodiments of the invention, 
examples of which are illustrated in the accompanying drawings. While the 
invention will be described in conjunction with the preferred embodiments, it 
5 will be understood that they are not intended to limit the invention to these 
embodiments. On the contrary, the invention is intended to cover alternatives, 
modifications and equivalents, which may be included within the spirit and 
scope of the invention as defined by the appended claims. Furthermore, in the 
following detailed description of the present invention, numerous specific 
1 0 details are set forth in order to provide a thorough understanding of the 

present invention. However, the present invention may be practiced without 
these specific details. In other instances, well-known methods, procedures, 
components, and circuits have not been described in detail as not to 
unnecessarily obscure aspects of the present invention. 

15 

Some portions of the detailed descriptions which follow are presented in 
terms of procedures, logic blocks, processing, and other symbolic 
representations of operations on data bits within an electronic computing 
device and/ or memory system. These descriptions and representations are the 

20 means used by those skilled in the data processing arts to most effectively 
convey the substance of their work to others skilled in the art. A procedure, 
logic block, process, etc., is herein, and generally, conceived to be a self- 
consistent sequence of steps or instructions leading to a desired result. The 
steps are those requiring physical manipulations of physical quantities. Usually, 

25 though not necessarily, these physical manipulations take the form of electrical 
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or magnetic signals capable of being stored, transferred, combined, compared, 
and otherwise manipulated in a computer system or similar electronic 
computing device. For reasons of convenience, and with reference to common 
usage, these signals are referred to as bits, values, elements, symbols, 
5 characters, terms, numbers, or the like with reference to the present invention. 

It should be borne in mind, however, that all of these terms are to be 
interpreted as referencing physical manipulations and quantities and are merely 
convenient labels and are to be interpreted further in view of terms commonly 

1 0 used in the art. Unless specifically stated otherwise as apparent from the 
following discussions, it is understood that throughout discussions of the 
present invention, discussions utilizing terms such as "partitioning", 
"receiving", "processing", "creating", "storing", "delivering", "accessing", 
"generating", "providing", "separating", "outputting", "performing", 

1 5 "marking" or the like, refer to the action and processes of a computer system, 
or similar electronic computing device, that manipulates and transforms data. 
The data is represented as physical (electronic) quantities within the computing 
device's registers and memories and is transformed into other data similarly 
represented as physical quantities within the computing device's memories or 

20 registers or other such information storage, transmission, or display devices. 

With reference now to Figure 1, a block diagram of an exemplary 
content analyzing system is shown in accordance with an embodiment of the 
present invention. In one embodiment, the content analyzing system 100 is 
25 used to analyze a web page at the developer level. That is, a user can analyze a 
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developing (or developed) web page at real time or near real time. The 
analysis may be a measure of the web pages conformity to an established 
standard (e.g., Section 508 (29 U.S.C. ' 794d) of the Rehabilitation Act), or to a 
personal standard, or against any rule-based code. Moreover, the data is 
5 pushed to the analyzer not vice-versa, therefore, the difficulties with dynamic 
IP addresses are moot. 

System 100 includes a computing system 110. Computing system 110 
may be any type of computing system (e.g., a desktop, laptop, handheld, or the 

1 0 like) such as described in Figure 5. In one embodiment, computing system 110 
includes a developer 120, a browser 130, and a servlet 140. In addition, servlet 
140 includes a filter 150. Developer 120 may be any type of web page 
development tool such as JDeveloper (a Java coding tool), or the like which 
may be used on a computing system to construct, modify, evaluate, enhance, 

1 5 reduce, or otherwise manipulate content on a web page. For purposes herein, 
a web page may be any type of electronic content to include forms, graphics, 
links, information, tables, and the like which may incorporate a plurality of 
computer code (e.g., HTML, Java, JPEG, MPEG, Applets, and the like). 

20 With reference still to Figure 1, in one embodiment, browser 130 is any 

program which may be used to access content from a web page. In one 
embodiment, servlet 140 is a web server which is utilized to run the page the 
coder or developer 120 has written (or is writing). In one embodiment, servlet 
140 is a small web server which also operates on the same computing system 

25 110 as the developer 120 and the browser 130. Additionally, servlet 140 is 
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configured to run a filter (or servlet filter) 150 which in one embodiment is a 
piece of code that operates in a J2EE environment. 



Also in system 100 is an analyzer 160. In one embodiment, analyzer 160 
5 operates on a separate computing system. In general, analyzer 160 is a tool 
used to test an application's HTML (.jar, .zip, Java, JPEG's, MPEG's, or other 
content on a web page). In one embodiment, the analyzer 160 takes an HTML 
file or stream as input, processes it according to its established coding standards 
rules, and generates a java.lang.String result to report to users. For example, 
1 0 analyzer 160 may be used to test the HTML of the web page on servlet 140 for 
proper accessibility coding standards adherence. 

In general, embodiments of the present invention provide a servlet filter 
150 to operate in conjunction with the analyzer 160 of Figure 1. For example, 

1 5 when testing a web site, the developer 120 requests rendering of an HTML 
page by the embedded servlet 140. The server is configured to run a servlet 
filter 150 which intercepts the request made to the servlet 140 and the response 
provided by the servlet 140. The filter 150 accesses the content (e.g., HTML 
page or the like) generated by the embedded servlet 140, sends the HTML (or 

20 the like) to a hosted service (analyzer 160) for processing, and then produces 
the results of the analysis on the same page as the original content generated 
by the embedded servlet 140. Thus, the analysis results are generated 
immediately for the user. 
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In another embodiment, the developer 120 may also allow this function 
(filter 150) to be turned on or off as desired. Furthermore, system 100 provides 
a mapping from what a user is working on (developer 120 rapid application 
development item IDs) and what the servlet filter 150 reports (same IDs) so that 
5 the user can find and fix problems while remaining in the environment in which 
they are working. 

With reference now to Figure 2, a flowchart of steps performed in 
accordance with one embodiment of the present invention for analyzing 
1 0 character data is shown. In one embodiment, the steps of flowchart 200 are 

performed by a user on the developer's computing system 110 under computer 
control. 

Referring now to step 202 of Figure 2 and to Figure 1, in one 
1 5 embodiment, a development tool is opened on a developer's computing 

system. Although a developer's computing system (e.g., computing system 
110) is stated, the developer may be a manager, evaluator, tester, user, 
colleague, or the like, who has reason to have intimate access with the content 
to be analyzed. However, for purposes of clarity the term developer and /or 
20 user will be used for clarity herein. 

Referring now to step 204 of Figure 2 and to Figure 1, in one 
embodiment an embedded servlet is launched on the computing system 110. 
The embedded servlet is used to run the content the developer (or coder) has 
25 written or is testing. Although in one embodiment, the content being tested 
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may be new, in another embodiment, it may be content being updated or 
checked due to rule changes (or after a specific time frame, or for evaluation 
purposes, or the like). 



5 Referring now to step 206 of Figure 2 and to Figure 1, in one 

embodiment a filter 150 coupled with the servlet 140 is activated. In general, 
any page that is requested of the servlet 140 may go through any sequence of 
filters 150. If more than one filter 150 is present, the filters 150 are called in 
chain fashion (e.g., one to the next). 

10 

Referring now to step 208 of Figure 2 and to Figure 1, in one 
embodiment a browser 130 is launched on the developer's computing system 
110. For example, the browser 130 is launched on a local (e.g., internal) 
network and therefore needs no IP address. In one embodiment, the browser 
1 5 130 may be a network browser, Internet browser, personal browser, company 
browser, HTML display program, or the like, which is used to display web page 
content. 



Referring now to step 210 of Figure 2 and to Figure 1, in one 
20 embodiment the desired content (e.g., web page) from the servlet 140 is 

requested. The request is received from the browser 130. In one embodiment, 
the browser 130 and servlet 140 are operating on the same computing system 
110. 
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Referring now to step 212 of Figure 2 and to Figure 1, in one 
embodiment the content is generated on the embedded servlet 140. The 
content may be in the form of a web page. In one embodiment, if the filter 150 
is turned off by the developer the content is delivered to the browser 130 from 
the servlet 140 and the process is complete. This is in the same fashion as 
normal content reception on a network or web environment is transmitted, 
which is well known in the art. However, if the filter 150 is turned on, that is 
the content is to be analyzed, then the filter 150 requests the content to be 
generated by servlet 140, and intercepts the output for later processing. 



Referring now to step 214 of Figure 2 and to Figure 1, in one 
embodiment the filter 150 reads the content and saves the content to a file in 
HTML format. For example, the client browser 130 calls the servlet 140 and 
requests a page. The system 100 then configures a web.xml file such that the 
1 5 HTTP request does not go to the servlet 140. Instead, it goes to any filter that is 
specified in web.xml. One such filter is the servlet filter 150. 

Thus, the servlet filter 150 receives the HTTP request for the given page. 
It passes that request along the chain of filters to the next in line, which typically 

20 is the servlet 140 itself. The servlet 140 generates HTML as it normally would, 
for example, going through transformation steps from metadata through an 
HTML rendering engine and out to HTML. However, where the servlet 140 
would normally write to an output stream which communicates to the browser 
130, instead the servlet filter 150 has placed an interceding output stream, which 

25 captures the HTML in a file. Thus, when the servlet 140 is finished generating 
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HTML, it actually has written to the temporary file created by the servlet filter 
150. 



In one embodiment, there are two parameters particular to the filter 150 
5 that are passed as initial parameters via the web.xml. They are accessibility 
checker (AC) server and AC port. The AC server parameter values refer to a 
hostname of the machine where the analyzer 160 is running. The AC port 
parameter specifies the port number on which the analyzer 160 is listening. 
One exemplary embodiment of the code is shown herein. The OACFilter 
1 0 pattern is shown here as a wildcard, but could be made specific to OA.jsp or to a 
pattern, such as "*.jsp". 



! Service-based example of XML 
<filter> 

1 5 <filter-name>OACFilter</filter-name> 

<f ilter-class>oracle.ada.filter.O ACFilter< / f ilter-class> 
<init-param> 
<param-name>0 ACServer< / param-name> 
<param-value>oacserver.us.oracle.com< / param-value> 
20 < / init-param> 
<init-param> 
<par am-name>0 ACPort< / param-name> 
<param-value>7891< / param-value> 
</init-param> 
25 </filter> 

<filter-mapping> 
<f ilter-name>0 ACFilter< / f ilter-name> 
<url-pattern>/ OA.jsp</ url-pattern> 
</ filter-mapping> 

30 

In one embodiment, the filter 150 utilizes a Java application to act as a 
listener and server for processing the HTML pages and returning analysis 
results. For example, it may include an outside listener that listens to a specified 
socket. Upon receiving a request on the socket, it immediately forks a thread 
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off to process the page. The thread reads the HTML from the socket and stores 
the data in a file. When reading completes, it calls the analyzer 160 to analyze 
the page (e.g., HTML data). Once the java.land.String of results are received 
from the analyzer 160, they are sent back to the caller on the socket's output 
5 stream. Therefore, in one embodiment, the filter 150 does not use a database at 
all. It reads messages from a message file and maintains errors only 
temporarily in memory for reporting to a user. 

Referring now to step 216 of Figure 2 and to Figure 1, in one 
1 0 embodiment the filter 150 sends the saved content to the analyzer 160. The 
analyzer 160 then checks the content for errors. For example, if the analyzer 
160 is checking for accessibility (e.g., section 508 of the rehabilitation act), the 
content will be checked to ensure correct HTML tags exist for browsers which 
do not display images, or that text fields are descriptively labeled, or the like. In 
1 5 another embodiment, the analyzer 160 may check to ensure the content is 
coded to HTML standards, or the number of words on a page, or how many 
images, or proper structure, or if every page has required footnotes or headers 
(e.g., privacy notices). 

20 In yet another embodiment, the filter 150 and analyzer 160 may perform 

regression testing. That is, as a user clicks through links, the clicks are captured 
and a baseline is found. Then the clicks are replayed and analyzed at a later 
date to see if the results are better, worse, different, or the like. In a further 
embodiment, filter 150 may pass JPEG or MPEG files to the analyzer 160 for 

25 image analysis (e.g., sufficient contrast, red-green analysis, and the like). 
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In addition, due to the location of the filter 150, secure web page content 
may also be analyzed without requiring additional steps. For example, in the 
prior art, an analyzer accessed the server for the page and therefore if the page 
5 was encrypted the server sent the page content encrypted and it was the 
responsibility of the analyzer to decrypt and analyze the content. However, 
the filter 150 used in the present embodiment may receive and transmit the 
page content to the analyzer 160 prior to the encryption process. Therefore, 
not only is no decryption required by the analyzer 160, the resulting 
1 0 information from the analyzer 160 is appended to the page prior to the 
encryption of the page. Thus, the results appended to the page are also 
encrypted when received to the browser 130. 

In one embodiment, the filter 150 includes lines of content, possibly in 
1 5 web.xml which directs the filter 150 to the location of the analyzer 160. 

Therefore, there is no need for the analyzer to search for the location of the 
machine having the content to be tested since the machine having the content 
to be tested is accessing the analyzer 160. In addition, if the analyzer 160 
changes location, the filter 150 can be updated to the new location of the 
20 analyzer 160. 

The filter 150 may also include extensible protocols for other possible 
future updates. For example, the filter 150 may be extended to inform the 
analyzer 160 to only fun the section 508 rules not international rules or to 
25 output the results in text instead of HTML or other language. In addition, the 
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HTML from the filter 150 to the analyzer 160 may include a tag from the filter 
150 telling the analyzer 160 what to analyze or how to analyze, or specific items 
on a per user or per page basis. The web.xml of the filter 150 may also be able 
to discern if the analyzer 160 is non-operational and post results on the browser 
5 130 accordingly, thereby preventing a non-operational analyzer 160 from 
crashing the computing system 110. 

Referring now to step 218 of Figure 2 and to Figure 1, in one 
embodiment, the results from the analyzer 160 are returned to the servlet 140 
1 0 and appended to the original content. For example, analyzer 160 returns the 
string of analysis results to the servlet filter 150. The filter appends the results 
to the web page that the servlet 140 itself produced, and sends the resulting 
HTML to the browser 130 at the client tier. In one embodiment, the library for 
this is named oacfilter.jar. 

15 

In one embodiment, the servlet filter 150 component is installed 
alongside developer 120 and the computing system 110 environment is 
configured to include the oacfilter.jar library. To configure the servlet 140, the 
web.xml file in use by computing system 110 (under the OA_HTML directory) 

20 is set up to run the filter 150 and pass appropriate parameters to it. When 

computing system 110 is set up by users not to perform accessibility checking, 
the web.xml file may have no mention of filter 150 configurations. This will 
withhold the filter 150 from the embedded servlet 140 runtime environment 
when the user is doing other work. In one embodiment, in order to perform 

25 the analysis or testing, the web site is run within its own internal server. 
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In one embodiment, if the content is a web page, then the results of the 
analyzer 160 will be appended to the page. The appending may occur at the 
bottom as shown in Figure 4, the appending may occur to the side of the page 
5 or to the top, or may be adjustable based on the preference of the user. 



Referring now to step 220 of Figure 2 and to Figure 1, in one 
embodiment the content is returned to the browser 130. Therefore, in one 
embodiment, when content is requested to be analyzed the user is shown the 

1 0 web page on the browser 130 in a true to life rendition. In addition, the page 
will also show any detected mistakes or errors in code on the page. Therefore, 
if further content needs to be checked (e.g., a plurality of linked web pages), a 
user may simply click on the link to the next page and, as long as the filter 150 is 
active, the next page will be displayed and any errors recognized by the 

1 5 analyzer 160 regarding the new page will be appended to the new page. In 
addition to maintaining the error results on the page having the errors, in one 
embodiment, the user need also only maintain one copy of the rules being 
evaluated. For example, if the rules are updated, it is simply a matter of 
updating the analyzer 160. Then, each page can be re-checked against the now 

20 new rules thereby saving deployment of software on a developer's computing 
system. 

With reference now to Figure 3, a diagram of an exemplary web page 
layout is shown in accordance with an embodiment of the present invention. 
25 That is, web page 300 illustrates one embodiment for presenting the content 



ORCL-2003-1 56-01 /ACM/JDY 



-19- 



ORACLE CONFIDENTIAL 

310 described herein. For example, the content 310 of web page 300 is shown 
as if the filter 150 of Figure 1 were turned off by the developer. Moreover, the 
content 310 of web page 300 is shown in the same format as it will be displayed 
once the content 310 is released to the network. 

5 

With reference now to Figure 4, a diagram of an exemplary web page 
layout with appended content analysis results is shown in accordance with an 
embodiment of the present invention. For example, the content 310 of web 
page 400 is shown as if the filter 150 of Figure 1 were operating and the 

1 0 analyzer 160 had found errors and returned them as results 410 amended to the 
content 310. Moreover, the content 310 of web page 400 is shown as it will be 
once the content 310 is released to the network. Therefore, a user is able to 
compare the results 410 with the content 310 while also being able to view and 
interact with the content 310 displayed on web page 400 (e.g., link access, and 

1 5 the like). 

Thus, in one embodiment, the amended results 410 include columns for 
priority 415, section 420, description 425, and line /character position 430. 
Priority 415 portrays the importance of the error found, section 420 quotes the 
20 rule or section of rules which guide the logic behind the error, description 425 
describes the error, and line/ char position 430 lists the location of the error in 
the code for the page. Although the four columns are shown in the amended 
results 410, there may be more or fewer columns having more, fewer, and /or 
different information displayed. Additionally, the format of the amended 
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results 410 may be organized in a plurality of different formats or priorities as 
desired by a user. 

In one embodiment, to test for accessibility, a user can set one or more 
5 flags on the computing system 110 and turn on the applications own 

accessibility mode, which causes a filter to generate accessible HTML. With the 
filter 150 turned on, the analysis results of a generated page's HTML will be 
presented at the bottom of the user's page. In some cases, this may require the 
user to scroll down or over to see the information. The information (e.g., 

1 0 results from analyzer) is sufficient to allow a user to find out the root cause of 
any errors or problems. In one embodiment, the analyzer 160 will also include 
HTML ID attributes that are associated with HTML elements that exhibit 
problems, and those ID attribute values will map to metadata in the extension 
framework. The report from the analyzer 160 may further indicate where in 

1 5 the generated HTML the problem is occurring, and provide the metadata 
identifier. In another embodiment, a link from each reported violation will 
allow a user to refer to coding examples of how to fix problems in HTML and 
metadata. Then, the user may fix the code in the metadata, and rerun the 
application to see the effects. As stated herein, the user may also click between 

20 pages within the scope of the developer 120 in the course of testing. 

In one embodiment, if the analyzer becomes outdated or is inaccessible 
with respect to the filter 150, an error message will be generated informing the 
user that an improper HTML response was received from the analyzer 160. 
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This may occur if the filter 150 is updated to expect a certain response for the 
analyzer 160, yet the analyzer 160 has not yet been upgraded. 

With reference now to Figure 5, a block diagram of an embodiment of 
5 an exemplary computer system 110 used in accordance with the present 

invention. It should be appreciated that system 110 is not strictly limited to be a 
computer system. As such, system 110 of the present embodiment is well 
suited to be any type of computing device (e.g., server computer, portable 
computing device, desktop computer, mobile phone, pager, personal digital 

1 0 assistant, etc.). Within the following discussions of the present invention, 

certain processes and steps are discussed that are realized, in one embodiment, 
as a series of instructions (e.g., software program) that reside within computer 
readable memory units of computer system 110 and executed by a processor (s) 
of system 110. When executed, the instructions cause computer 110 to perform 

1 5 specific actions and exhibit specific behavior that is described in detail herein. 

Computer system 110 of Figure 5 comprises an address/ data bus 510 for 
communicating information, one or more central processors 502 coupled with 
bus 510 for processing information and instructions. Central processor unit(s) 

20 502 may be a microprocessor or any other type of processor. The computer 
110 also includes data storage features such as a computer usable volatile 
memory unit 504 (e.g., random access memory, static RAM, dynamic RAM, 
etc.) coupled with bus 510 for storing information and instructions for central 
processor(s) 502, a computer usable non-volatile memory unit 506 (e.g., read 

25 only memory, programmable ROM, flash memory, EPROM, EEPROM, etc.) 
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coupled with bus 510 for storing static information and instructions for 
processor (s) 502. System 110 also includes one or more signal generating and 
receiving devices 508 coupled with bus 510 for enabling system 110 to interface 
with other electronic devices and computer systems. The communication 
5 interface(s) 508 of the present embodiment may include wired and /or wireless 
communication technology. For example, within the present embodiment, the 
communication interface 508 may be a serial communication port, a Universal 
Serial Bus (USB), an Ethernet adapter, a FireWire (IEEE 1394) interface, a parallel 
port, a small computer system interface (SCSI) bus interface, infrared (IR) 
1 0 communication port, Bluetooth wireless communication port, a broadband 
interface, or an interface to the Internet, among others. 

Optionally, computer system 110 may include an alphanumeric input 
device 514 including alphanumeric and function keys coupled to the bus 510 for 

1 5 communicating information and command selections to the central processor(s) 
502. The computer 110 can include an optional cursor control or cursor 
directing device 516 coupled to the bus 510 for communicating user input 
information and command selections to the central processor(s) 502. The 
cursor-directing device 516 may be implemented using a number of well- 

20 known devices such as a mouse, a track-ball, a track-pad, an optical tracking 

device, and a touch screen, among others. Alternatively, it is appreciated that a 
cursor may be directed and/ or activated via input from the alphanumeric input 
device 514 using special keys and key sequence commands. The present 
embodiment is also well suited to directing a cursor by other means such as, for 

25 example, voice commands. 
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The system 110 of Figure 5 may also include one or more optional 
computer usable data storage devices 518 such as a magnetic or optical disk and 
disk drive (e.g., hard drive or floppy diskette) coupled with bus 510 for storing 
5 information and instructions. An optional display device 512 is coupled to bus 
510 of system 110 for displaying video and /or graphics. It should be 
appreciated that optional display device 512 may be a cathode ray tube (CRT), 
flat panel liquid crystal display (LCD), field emission display (FED), plasma 
display or any other display device suitable for displaying video and /or graphic 
1 0 images and alphanumeric characters recognizable to a user. 

Thus, the present invention provides, in various embodiments, a method 
and system for analyzing character data. Furthermore, the present invention 
provides a method and system for analyzing character data and partitions the 
1 5 character data string into either single-byte character data string or multi-byte 
data. Additionally, the present invention provides a method and system for 
analyzing character data faster and less expensively. 

The foregoing descriptions of specific embodiments of the present 
20 invention have been presented for purposes of illustration and description. 
They are not intended to be exhaustive or to limit the invention to the precise 
forms disclosed, and obviously many modifications and variations are possible 
in light of the above teaching. The embodiments were chosen and described in 
order to best explain the principles of the invention and its practical application, 
25 to thereby enable others skilled in the art to best utilize the invention and 



ORCL-2003-1 56-01 /ACM/JDY 



-24- 



ORACLE CONFIDENTIAL 

various embodiments with various modifications are suited to the particular 
use contemplated. It is intended that the scope of the invention be defined by 
the Claims appended hereto and their equivalents. 
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